Approximability Distance in the Space of 
iJ-Colourability Problems 



Tommy Farnqvist, Peter Jonsson, and Johan Thapper 

Department of Computer and Information Science 
Linkopings universitet 
SE-581 83 Linkoping, Sweden 
{tomfa, pete j , johth}i§ida. liu. se 



Abstract. A graph homomorphism is a vertex map which carries edges 
from a source graph to edges in a target graph. We study the approx- 
imability properties of the Weighted Maximum H -Colourable Subgraph 
problem (Max H-Gol). The instances of this problem are edge- weighted 
graphs G and the objective is to find a subgraph of G that has maximal 
total edge weight, under the condition that the subgraph has a homo- 
morphism to H; note that for H = Kk this problem is equivalent to 
Max fe-cuT. To this end, we introduce a metric structure on the space 
of graphs which allows us to extend previously known approximability 
results to larger classes of graphs. Specifically, the approximation algo- 
rithms for Max cut by Goemans and Williamson and Max fc-cuT by 
Frieze and Jerrum can be used to yield non-trivial approximation results 
for Max H-Col. For a variety of graphs, we show near-optimality re- 
sults under the Unique Games Conjecture. We also use our method for 
comparing the performance of Frieze & Jerrum's algorithm with Has- 
tad's approximation algorithm for general Max 2-Csp. This comparison 
is, in most cases, favourable to Frieze & Jerrum. 
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1 Introduction 

Let G be a simple, undirected and finite graph. Given a subset S C V{G), 
a cut in G with respect to S is the edges from a vertex in S" to a vertex in 
V{G) \ S. The Max CUT-problem asks for the size of a largest cut in G. More 
generally, a fc-cut in G is the edges going from Si to Sj, i j, where Si, . . . ,Sk 
is a partitioning of V{G), and the Max fc-CUT-problem asks for the size of a 
largest fc-cut. The problem is readily seen to be identical to finding a largest 
fc-colourable subgraph of G. Furthermore, Max fc-cUT is known to be APX- 
complete for every fc > 2 and consequently does not admit a polynomial-time 
approximation scheme (Ptas). 



In the absence of a Ptas, it is interesting to determine the best possible 
approximation ratio c within which a problem can be approximated or, alterna- 
tively the smallest c for which it can be proved that no polynomial-time approx- 
imation algorithm exists (typically under some complexity-theoretic assumption 
such as P ^ NP). An approximation ratio of .878567 for Max cut was obtained 
in 1995 by Goemans and Williamson [15] using scmidcfinite programming. Frieze 
and Jerrum [14] determined lower bounds on the approximation ratios for Max 
fc-CUT using similar techniques. Sharpened results for small values of k have later 
been obtained by de Klerk et al. [9]. Under the assumption that the Unique 
Games Conjecture holds, Khot et al. [25] showed the approximation ratio for 
A; = 2 to be essentially optimal and also provided upper bounds on the approxi- 
mation ratio for fc > 2. Hastad [20] has shown that semidefinite programming is 
a universal tool for solving the general Max 2-Csp problem over any domain, 
in the sense that it establishes non-trivial approximation results for all of those 
problems. 

In this paper, we study approximability properties of a generalised version 
of Max fc-CUT called Max H-Col for undirected graphs H. Jonsson et al. [21] 
have shown that, when H is loop-free. Max H-Col does not admit a Ptas. Note 
that if H contains a loop, then Max H-Col is a trivial problem. We present 
approximability results for Max H-Col where H is taken from different families 
of graphs. Many of these results turns out to be close to optimal under the 
Unique Games Conjecture. Our approach is based on analysing approximability 
algorithms applied to problems which they are not originally intended to solve. 
This vague idea will be clarified below. 

Denote by G the set of all simple, undirected and finite graphs. A graph 
homomorphism h from G to H is a vertex map which carries the edges in G to 
edges in H. The existence of such a map will be denoted hy G ^ H. If both 
G ^ H and H G, the graphs G and H are said to be homomorphically 
equivalent. This equivalence will be denoted hy G = H. For a graph G G G, let 
W{G) be the set of weight functions w : E{G) ~> Q"*" assigning weights to edges 
of G. For a w G W(G), we let \\w\\ = J2eeE(G)'^^^) denote the total weight 
of G. Now, Weighted Maximum H-Colourable Subgraph (Max H-Col) is the 
maximisation problem with 

Instance: An edge-weighted graph (G, w), where G G G and w G W(G). 
Solution: A subgraph G' of G such that G' H. 
Measure: The weight of G' with respect to w. 

Given an edge- weighted graph (G, ui), denote by mcH{G,w) the measure of 
the optimal solution to the problem Max H-Col. Denote by m,Ck(G,w) the 
(weighted) size of the largest fc-cut in {G,w). This notation is justified by the 
fact that mck{G, w) = mcxk {G, w). In this sense. Max H-Col generalises Max 
fc-CUT. The decision version of Max ff-COL, the H-colouring problem has been 
extensively studied (Sec [17] and its many references.) and Hell and Ncsctfil [16] 
have shown that the problem is in P if iJ contains a loop or is bipartite, and 
NP-complete otherwise. Langberg et al. [27] have studied the approximability 



of Max H-Col when H is part of the input. We also note that Max H-Col is 
a speciahsation of the Max Csp problem. 

The homomorphism relation defines a quasi-order, but not a partial order 
on the set G- The failing axiom is that of antisymmetry, since G = H does not 
necessarily imply G = H. To remedy this, let Q= denote the set of equivalence 
classes of G under homomorphic equivalence. The relation is defined on G= in 
the obvious way and on this set it is a partial order. In fact, provides a lattice 
structure on Q= and this lattice will be denoted by Cs- For a more in-depth 
treatment of graph homomorphisms and the lattice Cs, see [17]. Here, we endow 
G= with a metric d defined in the following way: for M, N £ G, let 

d{M,N) = l- inf V^2M^. inf ^'^^f^^) (i) 
^^n{G,w) J^o^^mcM{GM 

In Lemma 5 we will show that d satisfies the following property: 

— Let M, N ^ G and assume that mcM can be approximated within a. Then, 
mcN can be approximated within (1 — d{M, N)) ■ a. 

Hence, we can use d for extending previously known approximability bounds on 
Max iJ-COL to new and larger classes of graphs. For instance, we can apply Goe- 
mans and Williamson's algorithm (which is intended for solving Max /■ir2-COL) 
to Max Cii-COL (i.e. the cycle on 11 vertices) and analyse how well the problem 
is approximated (we will see later on that Goemans and Williamson's algorithm 
approximates Max Ch-Col within 0.79869). 

In certain cases, the metric d is related to a well-studied graph parameter 
known as bipartite density b{H) [1, 3, 6, 18, 28]: if H' is bipartite subgraph of H 
with maximum number of edges, then 

urn - '^^'^ 
^^^^ - 

In the end of Section 2 we will see that b{H) = l — d{K2, H) for all edge-transitive 
graphs H. We note that while d is invariant under homomorphic equivalence, 
this is not in general true for bipartite density. 

The paper is divided into two main parts. Section 2 is used for proving the 
basic properties of d, showing that it is well-defined on G=, and that it is a metric. 
After that, we show that d is computable by linear programming and that the 
computation of d{M, N) can be simplified whenever M or A'' is edge-transitive. 
We conclude this part by providing some examples. 

The second part of the paper uses d for studying the approximability of 
Max i?-CoL. For several classes of graphs, we investigate optimiality issues by 
exploiting inapproximability bounds that arc consequences of the Unique Games 
Conjecture. Comparisons are also made to the bounds achieved by the general 
Max 2-CsP-algorithm by Hastad [20]. Our investigation covers a spectrum of 
graphs, ranging from graphs with few edges and/or containing large smallest 
cycles to graphs containing 0{rP') edges. Dense graphs are considered from two 
perspectives; firstly as graphs having a number of edges close to maximal and 



secondly as graphs from the Q{n,p) model of random graphs, pioneered by Erdos 
and Renyi [13]. 

The techniques used in this paper seem to generalise naturally to larger sets 
of problems. This and other questions are discussed in Section 4 which concludes 
our paper. 



2 Approximation via the Metric d 

In this section we start out by proving basic properties of the metric d, that 
{G=, d) is a metric space, and that proximity of graphs M, N in this space lets us 
interrelate the approximability of Max M-Col and Max N-COh. Sections 2.2 
and 2.3 are devoted to showing how to compute d. 



2.1 The Space {Q=,d) 

We begin by introducing a function s : Q x Q which enables us to express 
in a natural way and simplify forthcoming proofs. Let M, N €: G and define 

siM,N)= inf H^SM^^. (2) 

■ujew(s) jv V > ) 

The definition of d from (1) can then be written as follows: 

d{M,N) = \- s{N,M)-s{M,]<S). (3) 

A consequence of (2) is that the relation mcM(G, w) > s{M, N)-mcN{G,w) holds 
for all G G G and w € >V(G). Using this observation, we show that s{M, N) and 
thereby d{M, N) behaves well under graph homomorphisms and homomorphic 
equivalence. 

Lemma 1. Let M, N G G and M ^ N . Then, for every G G G and every weight 
function w € W{G), 

mcM{G,w) < mcN{G,w). 

Proof. If G' ^ M for some subgraph G' of G, then G' ^ N as well. The lemma 
immediately follows. □ 

Corollary 2. If M and N are homomorphically equivalent, thenmcM{G,w) = 
mcN{G,w). 

Corollary 3. Let Mi = M2 and Ni = N2 be two pairs of homomorphically 
equivalent graphs. Then, for k, I € {1, 2}, 

s{N,,M,)=s{Nk,Mi). 
Proof. Corollary 2 shows that for all G € G and w € W(G), we have 

mcM, (G, w) _ mcMi {G, w) 
mcNi{G,w) mcNkiGjw) 
Now, take the infimum over graphs G and weight functions w on both sides. □ 



Corollary 3 shows that s and d are well-defined as functions on the set G=- We 
can now show that d is indeed a metric on this space. 



Lemma 4. The pair {G=,d) forms a metric space. 

Proof. Positivity and symmetry follows immediately from the definition and the 
fact that s(M, N) < 1 for all M and N. Since s{M, N) = 1 if and only if TV ^ M, 
it also holds that d{M, iV) = if and only if M and are homomorphically 
equivalent. That is, d{M,N) = if and only if M and N represent the same 
member of G=. Furthermore, for graphs M, N and K G Q: 

s{M,N)-s{N,K)= inf ^^^jCM .^^ mc^iCw) 



'^sB rncN(G,w) "ee mcK(G,w) 

< inf ^-m{G,w) mc^iCw) 
™°w?s) mcNiG,w) mcK{G,w) 

Therefore, with a = s{M, N) ■ s{N, M), b = s{N, K) ■ s{K, N) and c = s(M, K) ■ 
s{K, M)>a-b, 

d{M, N) + d{N, K) - d{M, K) = l-a + l-h-{l-c)> 

> 1 - a - 5 + a • 6 = (1 - a) • (1 - 5) > 0, 

which sfiows that d satisfies the triangle inequality. □ 

We say that a maximisation problem 77 can be approximated within c < 1 
if there exists a randomised polynomial-time algorithm A such that c • Opt{x) < 
'E{A{x)) < Opt{x) for all instances x of 11 . Our next result shows that proximity 
of graphs G and H m d allows us to determine bounds on the approximability 
of Max H-Col from known bounds on the approximability of Max G-Col. 

Lemma 5. Let M,N,K he graphs. IfmcM can be approximated within a, then 
mcN can be approximated within a ■ {1 — d{M, N)). If it is NP-hard to approx- 
imate mcK within (3, then mcN is not approximate within 13/ (1 — d{N, K)) 
unless P= NP. 

Proof. Let A{G, w) be the measure of the solution returned by an algorithm 
which approximates mcM within a. We know that for s\\ G £ Q and w <E W(G) 
we have the inequalities mcjv(G,u') > s{N,M) ■ mcM{G,w) and mcM{G,w) > 
s{M,N) ■ mcN{G,w). Consequently, 

mcN{G, w) > mcM{G, w) ■ s{N, M) > A{G, w) ■ s(N, M) 

> mcM{G, w)-a - s{N, M) > mcN{G, w)-a- s{N, M) ■ s{M, N) 

= mcN{G, w)-a-{l- d{M, N)). 

For the second part, assume to the contrary that there exists a polynomial-time 
algorithm B that approximates m,cj^ within /?/(! — d(N,K)). According to the 
first part mcx can then be approximated within {l — d{N, K))-(3/ {l — d{N, K)) = 
p. This is a contradiction unless P = NP. □ 



2.2 Exploiting Symmetries 

We have seen that the metric (i(M, N) can be defined in terms of s(M, N). In fact, 
when M ^ N we have 1 - d{M, N) = s{M, N). It is therefore of interest to find 
an expression for s which can be calculated easily. After Lemma 6 (which shows 
how mcM{G,w) depends on w) we introduce a different way of describing the 
solutions to Max M-Col which makes the proofs of the following results more 
natural. In Lemma 7, we show that a particular type of weight function provides 
a lower bound on mcM{G,w)/mcNiG,w). Finally, in Lemma 8, we provide a 
simpler expression for s{M,N) which depends directly on the automorphism 
group and thereby the symmetries of N. This expression becomes particularly 
simple when N is edge-transitive. An immediate consequence of this is that 
s{K2,H) = b{H) for edge-transitive graphs H. 

The optimum mcH{G,w) is sub- linear with respect to the weight function, 
as is shown by the following lemma. 

Lemma 6. Let G,H G Q, a G Q+ and let w,w\, . . . ,Wr € W{G) he weight 
functions on G. Then, 

— mcH (G.a ■ w) = a ■ m.CH {G,w), 

- mcHiG,J2'i=iWi) < J2i=i'^CH{G,Wi). 

Proof. The first part is trivial. For the second part, let G' be an optimal solution 
to the instance (G, J2l^i Wi) of Max H-Col. Then, the measure of this solution 
equals the sum of the measures of G' as a (possibly suboptimal) solution to each 
of the instances {G,Wi). □ 

An alternative description of the solutions to Max H-COh is as follows: let G and 
H &g, and for any vertex map / : V{G) V{H), let f* : E{G) E{H) be the 
(partial) edge map induced by /. In this notation h : V{G) —^ V{H) is a graph 
homomorphism precisely when Qv^)^'^ {E{H)) = E{G) or, alternatively when h"^ 
is a total function. The set of solutions to an instance {G,w) of Max H-Col 
can then be taken to be the set of vertex maps / : V{G) y{H) with the 
measure 

^(/) = Yl ^(^)- 

ee{f*)-^E{H)) 

In the remaining part of this section, we will use this description of a solution. Let 
/ : V{G) V{H) be an optimal solution to the instance (G, w) of Max H-Col. 
Define the weight Wf G W{H) as follows: for each e G E{H), let 



We now prove the following result: 

Lemma 7. Let M, N <E Q he two graphs. Then, for every G <E Q , every w G 
W(G), and any optimal solution f to (G, li;) o/ Max iV-CoL, it holds that 




mcM{G, w) 
mcjv(G, w) 



> mcM{N, Wf). 



Proof. Arbitrarily choose an optimal solution g : V{N) V{M) to the instance 
{N,Wf ) of Max M-Col. Then, 5 o / is a solution to {G,w) as an instance of 
Max Af-COL. The weight of this solution is mcM{N,Wf) ■ mcj^{G,'w), which 
implies that 

mcM{G,w) > mcMiNjWf) ■ mcNlGjw), 

and the result follows after division by ■mcN{G,w). □ 

Let M and N ^ Q he graphs and let A = Aut(A^) be the automorphism 
group of N. We will let tt e ^ act on {u, v} G E{N) by tt • {u, v} = {7r(M), ^{v)}. 
The graph N is edge-transitive if and only if A acts transitively on the edges of 
N. Let VV(A^) be the set of weight functions w G W{N) which satisfy = 1 
and for which w(e) = 'w{Tr ■ e) for all e G E{N) and tt G Aut(A7'). 

Lemma 8. Let M,N G Q. Then, 
s{M,N)= inf mcM{N,w). 

In particular, when N is edge-transitive, 

s{M, N) = mcMiN, l/e{N)). 
Proof. The easy direction goes through as follows: 

s{M,N)< inf "^""MiN w) ^ .^^ mCM{N,w). 

w&w(N) mcN(Jy,w) weWiN) 

For the first part of the lemma, it will be sufficient to prove that the following 
inequality holds for for some w' gW. 

mcM(G,w) ^ 1^ 

> mcM{N, w') 4 

mcN{G,w) 

Taking the infimum over graphs G and weight functions w G W(G) in the left- 
hand side of this inequality will then show that 

s{M,N) > mcM{N,w') > inf mcM{N,w). 

wew{N) 

Let A = Aut(-A'') be the automorphism group of N. Let tt G ^ be an arbitrary 
automorphism of A''. If / is an optimal solution to {G,w) as an instance of 
Max A''-Col, then so is /jr = tt o /. Let — Wtto/- By Lemma 7, inequality 
(4) is satisfied by w^. Summing tt in this inequality over A gives 

mcM{G,w) ^ /Ar ^ 

\^\ 77^ — 7^/ 'rncM(N,w^) >mcM{N, } w^j, 

mcN(G,w) ^ 

where the last inequality follows from Lemma 6. The weight function X^^g^w^r 
can be determined as follows. 



(/#)-i(Ae) 



w{e') 



mcN{G,w) \Ae\ mcN{G,w) 



where Ae denotes the orbit of e under A. Thus, w' ^j^^aW-k/\A\ G W{N) and 
w' satisfies (4) so the first part follows. 

For the second part, note that when the automorphism group A acts transitively 
on E{N), there is only one orbit Ae = E{N). Then, the weight function w' is 
given by 

/. X _ 1 _ J2e'e{f*)-HE{N)) ^'(sO _ 1 ^ mcN{G,w) 
^^^'~e{N)' mcN{G,w) ~ e{N) ' mcN{G,w)' 

□ 



2.3 Tools for Computing Distances 

From Lemma 8 it follows that in order to determine s{M,N), it is sufficient to 
minimise mcM{N, w) over VV{N). We will now use this observation to describe a 
linear program for computing s{M. N). For z G {1, . . . , r}, let ^4^ be the orbits of 
Aut(A'') acting on E{N). The measure of a solution / when w G W{N) is equal 
to J2l^i Wi- fi, where Wi is the weight of an edge in Ai and fi is the number of 
edges in Ai which are mapped to an edge in M by /. Note that given a w, the 
measure of a solution / depends only on the vector (/i, . . . , f^) G N''. Therefore, 
take the solution space to be the set of such vectors: 

F = {{fi,..., fr) I / is a solution to (TV, w) of Max M-Col } 

Let the variables of the linear program he wi, . . . ,Wr and s, where wi represents 
the weight of each element in the orbit Ai and s is an upper bound on the 
solutions. 

min s 

J2ifi-Wi<s for each G F 

Wi, s > 

Given a solution Wi, s to this program, w{e) = Wi when e G is a weight 
function which minimises mcM{G, w). The value of this solution is s = s{M, N). 

Example 9. The wheel graph on k vertices, Wk, is a graph that contains a cycle 
of length k—1 plus a vertex v not in the cycle such that v is connected to every 
other vertex. We call the edges of the k — 1-cycle outer edges and the remaining 
k — 1 edges spokes. It is easy to see that Wk contains a maximum clique of size 
4 if A; = 4 (in fact, W4 = K4) and a maximum clique of size 3 in all other 
cases. Furthermore, is 3-colourable if and only if k is odd, and 4-colourable 
otherwise. This implies that for odd k, the wheel graphs are homomorphically 
equivalent to K^. 

We will determine s{K7i, Wn) for even n > 6 using the previously described 
construction of a linear program. Note that the group action of Aut(H^„) on 
E{Wn) has two orbits, one which consists of all outer edges and one which 
consists of all the spokes. If we remove one outer edge or one spoke from Wk, 



then the resulting graph can be mapped homomorphically onto K3. Therefore, 
it suffices to choose F = {f,g} with / = {k — l,k — 2) and g = {k — 2,k — 1) 
since all other solutions will have a smaller measure than at least one of these. 
The program for Wk looks like this: 



(fc - 1) • Wi + (fc - 2) • 102 < s 

{k -2) ■ wi + {k - 1) ■ W2 < s 

{k-l)-Wi + {k-l)-W2 = l 

Wi, s > 

The solution is wi = W2 = l/{2k - 2) with s{K3, Wk) = s = {2k - 3)/{2k - 2). 

Example 10. An example where the weights in the optimal solution to the linear 
program are different for different orbits is given by s(X2, ii'g/a). The rational 
complete graph ifg/s has vertex set {0, 1, . . . , 7}, which is thought of as placed on 
a circle with following 7. There is an edge between any two vertices which are 
at a distance at least 3 from each other. Each vertex has distance 4 to exactly 
one other vertex, which means there are 4 such edges. These edges form one orbit 
Ai and the remaining 8 edges form the other orbit A2. There are two maximal 
solutions, / = (0, 8) and g = (4, 6) which gives the program: 

min s 

■ Wi + 8 ■ W2 < s 

A - wi + 6 ■ W2 < s 
4 • wi + 8 • W2 = 1 

Wi, s > 

The solution to this program is wi = 1/20 and W2 = 1/10 with the optimum 
being 4/5. 

In some cases, it may be hard to determine a desired distance between H and 
M or N. If we know that H is homomorphically sandwiched between M and A'' 
so that M ^ H ^ N, then we can provide an upper bound on the distance of 
if to M or A^ by using the distance between M and N. Formally, we have: 

Lemma 11. Let M ^ H ^ N . Then, 



s{M, H) > s{M, N) and s{H, N) > s{M, N). 

Proof. Since H ^ N, it follows from Lemma 1 that mcH{G,w) < mcN{G,w). 
Thus, 



mm s 



s{M, H) 



. ^ mcM{G,w) 

mf — — - 

i^fo) mcH{G,w) 




mcM{G,w) 



= s{M, N). 




The second part follows similarly. 



□ 



We will see several applications of this lemma in Sections 3.1 and 3.2. 



3 Approximability of Max H-Col 



Let A be an approximation algorithm for Max H-Col. Our method basically 
allows us to measure how well A performs on other problems Max H'-Col. 
In this section, we will apply the method to various algorithms and various 
graphs. We do two things for each kind of graph under consideration: compare 
the performance of our method with that of some existing, leading, approxima- 
tion algorithm and investigate how close to optimality we can get. Our main 
algorithmic tools will be the following: 

Theorem 12 (Goemans and Williamson [15]). mc2 can be approximated 
within 

6/ IT 

OLGW = min — — — - w .878567. 

o<9<7r (l-cos6')/2 

Theorem 13 (Frieze and Jerrum [14]). mch can be approximated within 
1 21nA; 

Here, the relation ~ indicates two expressions whose ratio tends to 1 as A; ^ oo. 
We note that de Klerk et al. [9] have presented the sharpest known bounds on 
ak for small values of fc; for instance, > 0.836008. 

Let v{G),e{G) denote the number of vertices and edges in G, respectively. 
Hastad has shown the following: 

Theorem 14 (Hastad [20]). Let H be a graph. There is an absolute constant 
c > such that mcn can be approximated within 

(P ^ d?\ogd' 

where d = v{H) and t{H) = rf^ - 2 • e{H). 

We will compare the performance of this algorithm on Max H-Col with the 
performance of the algorithms in Theorems 12 and 13 analysed using Lemma 5 
and estimates of the distance d. This comparison is not entirely fair since Has- 
tad's algorithm was probably not designed with the goal of providing optimal 
results — the goal was to beat random assignments. However, it is the currently 
best algorithm that can approximate Max H-Col for arbitrary H G Q. For this 
purpose, we introduce two functions, FJk and Ha, such that, if is a graph, 
FJk{H) denotes the best bound on the approximation guarantee when Frieze 
and Jerrum's algorithm for Max fc-CUT is applied to the problem mcH, while 
Ha{H) is the guarantee when Hiistad's algorithm is used to approximate rn.CH- 
To be able to investigate the eventual near-optimality of our approxima- 
tion method we will rely on the Unique Games Conjecture (UGC). Khot [24] 
suggested this conjecture as a possible direction for proving inapproximability 
properties of some important constraint satisfaction problems over two variables. 
We need the following problem only for stating the conjecture: 



Definition 15. The Unique Label Cover problem C(y,W, E,[M], {Tr^'^}(^y^)(zE) 
is the following problem: Given is a bipartite graph with left side vertices V , right 
side vertices W , and a set of edges E . The goal is to assign one 'label' to every 
vertex of the graph, where [M] is the set of allowed labels. The labelling is sup- 
posed to satisfy certain constraints given by bijective maps (T,,_u, : [M\ [M]. 
There is one such map for every edge {v,w) £ E. A labelling 'satisfies' an edge 
{v,w) if av,wO-shel{w)) = label(f). The optimum of the unique label cover prob- 
lem is defined to be the maximum fraction of edges satisfied by any labelling. 

Now, UGC is the following: 

Conjecture 16 (Unique Games Conjecture). For any 77,7 > 0, there exists a 
constant M = M{ri, 7) such that it is NP-hard to distinguish whether the Unique 
Label Cover problem with label set of size M has optimum at least 1 — ry or at 
most 7. 

From hereon we assume that UGC is true, which gives us the following inap- 
proximability results: 

Theorem 17 (Khot et al. [25]). 

— For every e > 0, it is NP-hard to approximate mc2 within aaw + £• 

— It is NP-hard to approximate mck within{l—l/k+{2\uk)/k^-\-0{(\n.\n.k)/k^)). 

3.1 Sparse Graphs 

In this section, we investigate the performance of our method on graphs which 
have relatively few edges, and we see that the girth of the graphs plays a central 
role. The girth of a graph is the length of a shortest cycle contained in the graph. 
Similarly, the odd girth of a graph gives the length of a shortest odd cycle in the 
graph. 

Before we proceed we need some facts about cycle graphs. Note that the odd 
cycles form a chain in the lattice Cs between K2 and C3 = in the following 
way: 

— * • • • — > C2i+\ — > C2i-\ = K^. 

The following lemma gives the values of s(M, N) for pairs of graphs in this chain. 
The value depends only on the target graph of the homomorphism. 

Lemma 18. Let k < m be positive, odd integers. Then, 

s{K2,Ck)=s{Cm,Ck) = ^. 

Proof. Note that C2k+i -h -^2 and C2fc+i 7^ C2m-\-i- However, after remov- 
ing one edge from C2k+i, the remaining subgraph is isomorphic to the path 
-P2fc+i which in turn is embeddable in both K2 and C2to+i- Since C2fc+i is edge- 
transitive, the result follows from Lemma 8. □ 



With Lemma 18 at hand, we can prove the following: 



Proposition 19. Let k > 3 be odd. Then, FJ2(Ck) > 'Oigw o-nd Ha{Ck) = 
f k'^ log fc ~ k^Togk- Furthermore, mcc^. cannot he approximated within -^^j • 
otGW + ^ for any e > 0. 

Proof. From Lemma 18 we see that s{K2,Ck) = which impUes (using 

Lemma 5) that FJ2{Ck) > • acw- Furthermore, mc2 cannot be approx- 
imated within acw + for any e' > 0. From the second part of Lemma 5, we 
get that mcc,^ cannot be approximated within ■ {oigw -'f^') for any e' . With 
e' = e ■ the result follows. 
Finally, we see that 

„ , , / c \ ck + 2P log k- 2c 

Ha{Ck) = 1 To— • 1 - 



fc^ \ fc2 log k J k^ log k 



2 c 2c 

= T + 



k k'^ log k k^ log k ' 

Hastad's algorithm does not perform particularly well on sparse graphs; this 
is reflected by its performance on cycle graphs Ck where the approximation 
guarantee tends to zero when k ^ oo. We will see that this trend is apparent 
for all graph types studied in this section. 

Now we can continue with a result on a class of graphs with large girth: 

Proposition 20. Let m > k > A. If H is a graph with odd girth g > 2k + 1 

and minimum degree > ^q^ry, then FJ2{H) > 2k+i ' '^Gw and men cannot he 
approximated within ^^^^ • aaw + £ for any £ > 0. Asymptotically, Hd{H) is 
hounded hy^^ + ^-^^^ - ^""Vj^r-^'^ , where n = v[H). 

Proof. Lai & Liu [26] have proved that if i? is a graph with odd girth > 2fc+l and 
minimum degree > ^^^jj, then there exists a homomorphism from H to C2k+i- 
Thus, K2 H ^ C2/C+1 which implies that 1 - d{K2, H)>1- d{K2, C2k+i) = 
2!^. By Lemma 5, F,J2{H) > 2k+i ' '^GW, but mcH cannot be approximated 
within • aaw + £ for any £ > 0. 

Button and Brigham [10] show that one upper bound on e{H) has asymptotic 
order n^+^/(f~^). This lets us say that 

, , n2 - 2 • ni+2/(9-i) / 
Hd{H) ~ 1 5 1 - 



log n 

_ cn? + 2n(39-i)/(s-i) logn - 2n^B+^y^B-^^c _ 
log n 

_ c 2(n9/(9~i))3 2n<^'^s--^)n^/{g-^)c 

n^logn n^n}/ia-^) n^logn 



□ 



If we restrict ourselves to planar graphs, then it is possible to show the following: 



Proposition 21. Let H be a planar graph with girtli at least g = ^"3 ^ . // 
v{H) = n, then FMH) >^,-aGw and H&{H) < f - ^ + - + 

raMog n • ''^^H Cannot he approximated within • Q-gw + £ /or anj/ £ > 0. 

Proof. Borodin et al. [7] have proved that is (2 + ^)-colourable which is 
equivalent to saying that there exists a homomorphism from H to C2k+i- The 
proof proceeds as for Proposition 20. 

The planar graph H cannot have more than 3n— 6 edges so Ha{H) is bounded 
from above by 

^ _ - 2(3n - 6) _ / _ c \ ^ 
V? \ V? log n J 

cv? — 6nc + 12c + 6n^ log n — 12n^ log n 
log n 

_ 6 12 c 6c 12c 

n log n log n log n 

(In fact, if contains no more than max{g{n — 2)/{g — 2),n — 1} edges, but using 
this only makes for a more convoluted expression to study.) □ 

Proposition 21 can be strengthened and extended in different ways: one is to 
consider a result by Dvorak ct al. [11]. They have proved that every planar graph 
H of odd-girth at least 9 is homomorphic to the Petersen graph P. The Petersen 
graph is edge-transitive and it is known (cf. [3]) that the bipartite density of P is 
4/5 or, in other words, s{K2, P) = 4/5. Consequently, m,CH can be approximated 
within I • aaw but not within | • acw + £ for any e > 0. This is better than 
Proposition 21 for planar graphs with girth strictly less than 13. 

Another way of extending Proposition 21 is to consider graphs embeddable 
on higher-genus surfaces. For instance, the lemma is true for graphs embeddable 
on the projective plane, and it is also true for graphs of girth strictly greater 
than ^°3~^ whenever the graphs arc embeddable on the torus or Klein bottle. 
These bounds are direct consequences of results in Borodin et al. 

We conclude the section by looking at a class of graphs that have small 
girth. Let < /3 < 1, be the approximation threshold for TOC3, i.e. mca is 
approximable within /3 but not within (3 + e for any e > 0. Currently, we know 
that as < 0.836008 < /3 < j^l [9,22]. The wheel graphs Wu from Example 9 
are homomorpliically equivalent to K-i, for odd k and we conclude (by Lemma 5) 
that mcwk has the same approximability properties as mcs in this case. For even 
k>6, Wk is not homomorphically equivalent to K3, though. 

Proposition 22. For k > Q and even, FJ^iWk) > 0:3 • ^"^t mcwk is not 

approximable within (3 ■ §e|. Ha{Wk) = | - ^ + wtik ' + 

Proof. We know from Example 9 that Ks Wk and s{K3,Wk) = ||e|. The 
first part of the result follows by an application of Lemma 5. 

Ha{Wk) = 1-^.(1- =/d = k, e{Wk) = 2{k - 1)/ = 



_^ fc^-4(fc-l) 



fc^c + 4fc^ log - Akc - 4fc^ log k + Ac 
fc^ log A; 

4 4c 4c 4c 
+ ttt; r - 1^-; r + 



k fc^ fc^ log k k^ log A; log k 

□ 

We see that FJsCWk) — > as when fc — > oo, while i?d(Wfe) tends to 0. 
3.2 Dense and Random Graphs 

We will now study dense graphs, i.e. graphs H containing 0{v{H)^) edges. For a 
graph H onn vertices, we obviously have H Kn- If we assume that i^{H) > r, 
then we also have Kr H. Thus, if we could determine s{Kr, Kn), then we could 
use Lemma 11 to calculate a bound on FJn{H). 

Let oj{G) denote the size of the largest clique in G and x(G) denote the chro- 
matic number of G. The Turan graph T(n, r) is a graph formed by partitioning 
a set of n vertices into r subsets, with sizes as equal as possible, and connecting 
two vertices whenever they belong to different subsets. Turan graphs have the 
following properties [31]: 

- e(T(n,r)) = L(l-i)-^J; 

- u{T{n, r)) = x(T(n, r)) = r; 

- if G is a graph such that e{G) > e{T{v{G),r)), then uj{G) > r. 
Lemma 23. Let r and n be positive integers. Then, 

s{Kr,Kn)=e{T{n,r))/e{Kn) 

Proof. Since Kn is cdgc-transitivc, it sufHces to show that mCr{Kn, l/e{Kn)) = 
e{T{n,r))/e{Kn). Assume mcr(-ftr„, l/e(if„)) • e{Kn) > e{T{n,r)). This implies 
that there exists an r-partite graph G on fc vertices with strictly more than 
e(r(n,r)) edges — this is impossible since uo{G) > r and, consequently, x(G) > 
r. Thus, mcK^{Kn, l/e{Kn)) ■ e{Kn) = e{T{n, r)) because T{n, r) is an r-partite 
subgraph of Kn . □ 

Now, we are ready to prove the following: 
Proposition 24. Let v{H) = n and pick r S N, cr € M such that 



Then, 

2 

FJn{H) >an-- 
Ha{H) =2a + 



<a-r? = eiH) < 



^11 21nn 
n • (n — 1) r n n{n — 1) 

C 2cr • C 



log n rfi log n 



Proof. We have Kr H due to Turan and H K„ holds trivially since 
v{H) = n. By Lemma 23 



2 

s{Kr,Kn) = - 



1 



n • (n — 1) 

The first part of the result follows from Lemma 5 since d{H, Kn) < d{Kr, Kn) = 
1 — s{Kr,Kn) and some straightforward calculations. 



9 9 

n — a ■ n 



Ha{H) = 1 2 1 



'n? log n 



c + 2(7 • log n — 2a-c c 2c7 • c 

^ - +2(7 



log n log n log n ' 

□ 

Note that when r and n grow, FJn(H) tends to 1. This means that, asymptoti- 
cally, we cannot do much better. If we compare the expression for FJ„(iJ) with 
the inapproximability bound for mc„ (Theorem 17), we sec that all we could hope 
for is a faster convergence towards 1. As cr satisfies (l — • ^ < cr < (l ^ ' 
we conclude that Ha{H) also tends to 1 as r and n grow. To get a better grip 
on how Ha{H) behaves we look at two extreme cases. 
For a maximal cr = (l — i) • ^, Ha{H) becomes 

^-- + —^ • 

n n'^ log n 

On the other hand, this guarantee, for a minimal cr = (l — i) • ^ is 

1-1+ ' 



r ' rn^ log n 

At the same time, it is easy to see that Frieze and Jerrum's algorithm makes 
these points approximable within an (since, in this case, H = Kn) and (since 
Turan's theorem tells us that H ^ Kr holds in this case), respectively. Our 
conclusion is that Frieze and Jerrum's and Hastad's algorithms perform almost 
equally well on these graphs asymptotically. 

Another way to study dense graphs is via random graphs. Let G{n,p) denote 
the random graph on n vertices in which every edge is chosen randomly and 
independently with probability p = p{n). We say that Q{n,p) has a property A 
asymptotically almost surely (a.a.s.) if the probability it satisfies A tends to 1 as 
n tends to infinity. Here, we let p = c for some < c < 1. 

For G G Q{n,p) it is well known that a.a.s. lo{G) assumes one of at most 
two values around in{i/p) [5,30]. It is also known that, almost surely x(G) ~ 

2in(rap) (i^) ' B.S np oo [4,29]. Let us say that x(G') is concentrated in 
width s if there exists u = u{n,p) such that a.a.s. u < x{G) < u + s. Alon 
and Krivelevich [2] have shown that for every constant (5 > 0, if p = n^^/^^'' 
then x(G) is concentrated in width s = 1. That is, almost surely, the chromatic 
number takes one of two values. 



Proposition 25. Let H S G{n,p). When np oo, FJm,{H) 
^ - ^here m = uj{H). Ha{H) =p- £ + (1 -p) - 



+ 



2 I 2 In m 
pc 



m 



Proof. Let k = x{H). 

FJm{H)>am-s{Km,Kk)^ 
(m^ — m + 21nm)(m — l)fc 



1 21nm 
+ 



m 



ri^ log n log n * 



A;(/;;- 1) 



A; 



m3(fc- 1) 
2fc 



2A:ln5 



2fclin 



fc — 1 m{k — 1) TO2(fc 
If n is large, then m and 

1 



(**) 



2 21nm 
1 + 5- + 



1) m^{k-l) m3(fc-l) 



21nm 



(**) 



The expected number of edges for a graph H G Q{n,p) is {^^p, so 



= 1 



= 1 - 



(n^ — n)p 



(1- 



log n 



) = 1- 



n — pn + p 



Pi 
(1- 



log n 



) = 



l-{l-p 
P'n? log n + nc 



-) 



log n ' 
pnc — p-n? log n+pc 



n 



■P) 



log n 
c 
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pc 



V? log n log n 



□ 



We see that, in the limiting case, Ha{H) tends to p, while FJm{H) tends to 
1. Again, this means that, for large enough graphs, we cannot do much better. 
With a better analysis, one could possibly reach an expression for FJm{H) that 
has a faster convergence rate. 

Of course, it is interesting to look at what happens for graphs H e Q{n,p) 
where np does not tend to oo when n —^ oo. The following theorem lets us do 
this. 

Theorem 26 (Erdos and Renyi [13]). Letc be a positive constant andp = ^. 
If c < I, then a.a.s. no component in G{n,p) contains more than one cycle, and 
no component has more than ^ vertices. 

Now we see that if np ^ e when n — > oo and < e < 1, then G{n,p) almost 
surely consists of components with at most one cycle. Thus, each component 
resembles a cycle where, possibly, trees are attached to certain cycle vertices, and 
each component is homomorphically equivalent to the cycle it contains. Since we 
know from Section 3.1 that Frieze and Jcrrum's algorithm performs better than 
Hastads algorithm on cycle graphs, it follows that the same relationship holds 
in this part of the G{n,p) spectrum. 



4 Conclusions and Open Problems 

We have seen that applying Frieze and Jerrum's algoritiim to Max JT-Col gives 
comparable to or better results than when applying Hastad's Max 2-Csp al- 
gorithm for the classes of graphs we have considered. One possible explanation 
for this is that the analysis of the Max 2-Csp algorithm only aims to prove 
it better than a random solution on expectation, which may leave room for 
strengthenings of the approximation guarantee. At the same time, we are proba- 
bly overestimating the distance between the graphs. It is likely that both results 
can be improved. 

Kaporis et al. [23] have shown that mc2 is approximable within .952 for 
any given average degree d and asymptotically almost all random graphs G in 
Q{n,m = where G{n,m) is the probability space of random graphs on 

n vertices and m edges selected uniformly at random. In a similar vein, Coja- 
Oghlan et al. [8] give an algorithm that approximates mck within 1 — 0(1/ ^Jrvp) 
in expected polynomial time, for graphs from Q{n,p). It would be interesting to 
know if these results could be carried further, to other graphs G, so that better 
approximability bounds on Max H-Col, for H such that G H, could be 
achieved. 

Erdos [12] has proved that for any positive integers k and I there exists 
a graph of chromatic number k and girth at least I. It is obvious that such 
graphs cannot be sandwiched between K2 and a cycle as was the case of the 
graphs of high girth in Section 3.1. A different idea is thus required to deal with 
those graphs. In general, to apply our method more precisely, we need a better 
understanding of the structure of Cs and how this interacts with our metric d. 

The idea of defining a metric on a space of problems which relates their 
approximability can be extended to more general cases. It should not prove too 
difficult to generalise the framework introduced in this paper to Max CSP over 
directed graphs or even languages consisting of a single, finitary relation. How 
far can this generalisation be carried out? Could it provide any insight into the 
approximability of Max CSP on arbitrary constraint languages? 

When considering inapproximability, we have strongly relied on the Unique 
Games Conjecture — hence, we are part of the growing body interested in seeing 
UGC settled. We note, though, that weaker inapproximability results exist for 
both Max cut [19] and Max fc-cuT [22] and that they are apphcable in our 
setting. We want to emphasise that our method is not per se dependent on the 
truth of the UGC. 
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